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Counting pairs is the oldest trick in combinatorics... Every time we count pairs, we learn 
something from it. 
Gil Kalai 

Abstract. Higher-dimensional Dedekind sums are defined as a generalization of a recent 1-dimensional 
probability model of Dilcher and Girstmair to a d-dimensional cube. The analysis of the frequency 
distribution of marked lattice points leads to new formulae in certain special cases, and to new 
bounds for the classical Dedekind sums. Upper bounds for the generalized Dedekind sums are 
defined in terms of 1-dimensional moments. In the classical two-dimensional case, the ratio of these 
sums to their upper bounds are cosines of angles between certain vectors of n-dimensional cones, 
conjectured to have a largest spacial angle of tt/6. 



1. Introduction 

Historically, Dedekind sums first appeared in Dedekind's transformation law of his ^-function [2]. 
Dedekind sums have since become an integral part of combinatorial geometry (lattice point enu- 
meration [8]), algebraic number theory (class number formulae [7]), topology (signature defects of 
manifolds [4]), and algorithmic complexity (pseudo random number generators [5]). We begin by 
defining the classical Dedekind sum, whose basic ingredient is the sawtooth function 

((*)) = ( {X} ~^ * x *l 

Here {x} = x — [x\ denotes the fractional part of a real number x. 

For any two positive integers a and b, we define the classical Dedekind sum as 

k mod fe VV / / \ \ / / 

Here the sum is over a complete residue system modulo b. 

The classic introduction to the arithmetic properties of the Dedekind sum is [9]. The Dedekind 
sums have recently been cast in a new light as essentially the second moments of an appealing 
probability model introduced by Dilcher and Girstmair [3] . They divide an interval of length a into 
b equal subintervals ( "boxes" ) and count the number of integers in each subinterval. 
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We generalize their approach by considering a (i-dimensional cube (d > 2) of side length a £ N. 
Along the d-dimensional main diagonal we mark the points with integer coordinates. The cube 
is now partitioned to 6162 •■■ b d rectangular parallelopipeds by dividing the j th side of the cube 
to bj equal length intervals. Each parallelopiped is given the coordinates (ji,j 2 , • • • , 3d) where 
jk = l,...,bk (k = l,...,d). Let f a ;b u ...,b d (ji, ■ ■ ■ ,3d) denote the number (frequency) of marked 
points along the main diagonal which belong to the j 2 , . . . , jd)-parallelopiped. The generalized 
Dedekind sums under consideration are 



61-1 



(1.2) 



S d (a; b) = S d (a; 61, b 2 , ■ ■ ■ , b d ) = - ^ • • • ^ h ■ ■ ■ k d f a ;b u ...,b d {ki,...,k d ), 



k 1= k d =0 



a mixed moment for the f a -bi,...,b d distribution. In Section 2 we present basic definition and facts. 
We show that 



(1.3) 



S d (a;h) 



1 a— 1 

i E 



mbi 




mb d 


a 




a 



For the case d = 1 we define the k th moment as 



a-l 



(1.4) 



M k (a;b) 



m=0 



mb 
a 



These moments are used in Sections 5 and 6 to provide, via the Cauchy-Schwartz inequality, upper 
bounds for S d (a; b). 

The second moment M2(a;b) is of special importance since it is connected with the classical 
Dedekind sum s(a,b) according to the formula (see Section 2) 



(1.5) 



M 2 (a;b) 



(6 2 + l)(a-l)(2a-l) (a - 1)6 26 



6a 2 



2a 



s(b, a) . 

a 



By analyzing the structure of the univariate and bivariate frequency distributions (Section 3), we 
derive in Section 4 new formulae for M2(a;b), in some special cases, and provide several types 
of lower and upper bounds. In Section 5 we analyze the ratio of 52(0561,62) to its upper bound 

1 /2 

{M 2 {a\ 6i)M2(a; 62)) . All these ratios ^2(0561,62) are empirically found to be greater or equal 
to i?2(5;2, 3) = \/3/2, leading to the following conjecture (see section 5.1 below): 



Conjecture 1. For all a, 6, c > 3, 
(1.6) 



^/3 

R 2 (a;b,c)>R 2 (5;2,3) = ^- . 



Geometrically, S 2 (a;bi,b 2 ) is an inner product of the vectors v& 1 = 



( 


k 




2&i 




(o-l)6i 


) 




a 


, 


a 


, . . . , 


a 





and vj 2 = |^ k ; _2 ^ a J in M a 1 , and R 2 (a; 61, 62) is the cosine of the angle between 
these two vectors. It appears from both empirical and theoretical evidence that all these vectors, 
for a > 3, 61, 62 > 2, are within a cone with largest possible angle of cos _1 (\/3/2) = 7r/6. In Section 
5.1 we have some general results and observations on the functions R 2 (a;b±,b 2 ). In Section 5.2 we 



analyze the geometry of the vectors v;, 



(q-l)b 
a 



^ and prove several lemmas, 
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which lend further credence to the validity of Conjecture 1. Finally, in Section 6 we present higher- 
dimensional upper bounds for Sd(a;h) in terms of the r'th moments M r (a;b), and prove that 
M r (a; b) is log-convex in r. 



2. Generalizing the Dilcher-Girstmair model 



We introduce the ci-dimensional analog of the Dilcher-Girstmair model. We begin gently with the 
two-dimensional extension: given three positive integers a, b, and c, divide one of the sides of the 
square [0, a) x [0, a) into b parts of length a/b, and the other side into c parts of length a/c. This 
division induces a grid (see Figure 1 for an example). We thus have be boxes of equal size. We 



Figure 1. a = 6, 6 = 5, c = 7 

think of each box as half open: we count the left (excluding the highest point) and bottom side 
(excluding the right-most point) as belonging to the box. Let's mark each box by a pair of integers 
(j, k) where < j < b — 1 and < k < c— 1. We will study the integer lattice points in the square; 
note that the box (j, k) contains the point (m, n) G I? if and only if 



(2.1) 



— < m < 

b b 



ka (k + l)a 
and — < n < 

c c 



Equivalent to this condition is the following: 

mb 

J < — < 3+1 
a 

which can be rewritten in compact form using the greatest integer function [x\ (the greatest integer 
not exceeding x): 



nc , 

and k < — < k + 1 , 
a 



(2.2) 



mb 



and 



A: 



nc 
a . 



We formalize the distribution of integer points within each of the be boxes as follows. 

Definition 1. Let f a bc(j\k) denote the number of marked lattice points on the diagonal in the box 
(j,k). 



Notice that most of these frequencies are zero. We can evaluate the following sum in two ways 
according to the equivalence of (2.1) and (2.2): 

6-1 c-1 a-1 

(2.3) EE^w^) = E 

j=0 k=0 m=0 

A special case of this is b = c, for which we make the following definition. 

Definition 2. Let f a -b(j) denote the number of marked integers in the j th subinterval when we 
divide [0, a) into b equal parts. 



mb 




mc 


a 




- a - 
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We remark that f a -,b(j) = fa;b,b(j)- When b = c the double sums reduce to the one-dimensional 
sums studied by Dilcher and Girstmair, i.e. 

6-1 6-1 6-1 

(2.4) EE^' fc = Y,i 2 u(j) = E 

j=0 k=0 j=0 m=0 

The sum on the right hand is essentially a classical Dedekind sum: If a and b are relatively prime, 



a-1 2 

mb 



a-1 

E 

m=0 L 



mb 
a 



a-1 

= E 

m=l 

a-1 



mb 
a 



mb 
a 

a-1 



a-1 



a 



m=l 



m=l 



m 
a 



m=l 

mb 
a 



m=l x ' m 
,o a— 1 a— 

E™ - 26 E 



2b V - + — i- ^ + E 

m=l m=l 



y + 1 )(a 6 -l ) (2a-l ) _ 2ta(M) _l Ma _ 1) 



This and similar sums coming from the one-dimensional case will appear repeatedly in the exposition 
that follows. 



Definition 3. For any two positive integers a and b, let 

^ a— 1 

M k {a;b) = -J2 



a-1 , k 
mb 



m=0 



Mk is the k th moment of the Dilcher-Girstmair probability distribution (see Section 6). By (2.4), 
the definition of Mj, is equivalent to 

1 b ^ 

M k (a;b) = -Y,j k fa;bU) ■ 

tt 3=0 

We just showed above that corresponds to the classical Dedekind sum s(a, b) as in (1.5). 

The model that we described above extends naturally to higher dimensions. Instead of considering 
a square, let's divide the d-dimensional cube [0, a) x • • • x [0, a) into b\ ■ ■ ■ b^ equal boxes by a similar 
construction as above: now we divide the first side into b\ equal intervals, the next one into 62 equal 
intervals, and so on. Again we will count the number of marked integer lattice point on the main 
diagonal of this cube, according to the box they are in. As above we will label each box, say by 
(ki, . . . , kd) , < kj < bj — 1, and we will denote the function counting the lattice points in box 
(ki,...,k d ) by 

fa;b u ...,b d (h, ■ • • , fed) • 
As before, an elementary counting-two-ways argument yields 

61 — 1 brf — 1 a— I 

E E k l--- k dfa-M,...,b d (h,...,k d ) = ^ 
fci=0 k d =0 m=0 

This naturally leads to the following definition. 



mb\ 




mb d 


a 




a 
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Definition 4. For positive integers a, b±, 62, ■ ■ ■ , b d , we define 

^ a— 1 

S d (a; b) = S d (a; b u b 2 , . . . , b d ) = - V 

n L — * 



This is a generalized Dedekind sum. Our goal is to find relations for the sums S d (a; b). 



3. The two-dimensional frequency distribution {f a -b,c(j, k)} and its marginal 

DISTRIBUTIONS 

In this section we focus on the study of the distribution frequencies f a -b,c(j, k) using the duality 
interpretation given by (2.3). It appears impossible to derive a closed formula for the number of 
diagonal lattice points that belong to the (J, k)th rectangle, that is, f a -,b,c(ji k), j = 0, . . . , b— 1, k = 
0, ...,c— 1. We developed an algorithm, given in the appendix, for computing the values of 
fa-,b,c(j> k) and of the marginal frequencies 

c-l 6-1 

fa;b(j) = /o;6,c(j. k ) an d fa;c( k ) = f>&M> k ) ■ 

k=0 j=0 



mb\ 




mb d 


a 




a 



Example. In Table 2 we present these distributions for the case a = 50, b = 13, c = 7. From this 
table we can immediately verify that 



12 6 49 
^^^j/50;i3,7(i,A;) = Y 

j=0 k=0 m=0 



13m 



7m 
~50 



1236 



j\k 
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2 


3 


4 


5 


6 
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1 
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3 





3 
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4 
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2 
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3 


7 











2 
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4 


8 














4 








4 


9 














1 


3 





4 


10 

















4 





4 


11 




















4 


4 


12 
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3 


f a dk) 


8 


7 


7 


7 


7 


7 


7 


50 



Figure 2. The two-dimensional distribution and its marginals for a = 50, b = 
13, c = 7 
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By analyzing the structure of the marginal distributions we can arrive at closed formulae for 
Mk(a;b). For example, one can immediately verify that 

a 



and 
(3.1 



fa;b{j) = n 
fa;bU) = 



j = 0,...,b-l 



Thus for a = 0, 1 mod b we immediately obtain 



if a = mod b , 



if a = 1 mod b 



M k (a;b) 



1 I a 



6-1 



if J E>* ■ 



In general, the one-dimensional frequencies can be bounded as 



(3.2) 



a 
lb 



< fa;b(j) < 



A book-keeping device that will help us keep track of the difference between the frequency and [|J 
is the following. 

Definition 5. I a . b (j) = f a , b (j) - [f J . 



Notice that by (3.2) we have I a - t b{j) = 0, 1 for all a, b, j = 0, . . . , b — 1. Accordingly we rewrite the 
k th moment of the Dilcher-Girstmair distribution as follows. 



6-1 6-1 



(3-3) M k (a;b) = -i [£J £j* + £// a;& (j) . 

\ i=i i=i / 

The second sum allows for a finer analysis of these moments. A trivial example follows from the 
fact that I a -b(j) > 0: 



M i( «;6)>i[|jE;'. 



6-1 



This bound gets achieved, for example, when a = 0, 1 mod 6. In the following section, we study 
Ia-,b{j) and its second moments. 



4. Some Formulae and bounds for M 2 



Of special interest is M2(a;6), due to its relationship to the Dedekind sum s(a,b). According to 
(3.3) 

6-1 

(4.1) aM 2 {a-b) 



(6 - 1)6 ( 2t _ 1) +E/Mj) 



One may think about this identity in terms of the Dilcher-Girstmair distribution model: Among 
the a integers in [0, a), we have at least [a/b\ of them in each interval 

ka (k + l)a N 
T' b~ 



k = 0, ...,6-1 . 
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These integers are represented in the first term on the right-hand side of (4.1). Suppose a = I 
mod 6 where < I < b (the case b\a is special and very easy to handle: I a -,b(j) = for all j); 
then there are I — 1 integers "left" which haven't been accounted for (note that the first interval 
[0, a/6) contains [a/b\ + 1 integers). These I — 1 integers are represented in the second term on 
the right-hand side of (4.1). In fact, one can say more about them. Because they are uniformly 
distributed among the b intervals, we obtain 



(4.2) 



D 2 (a;b) = b -£fl a ., b (j) = ^ 

3=1 



m=l 



mb 

T 



lM 2 (l;b) , 



I > 2. 



Note that, in particular, D 2 (a;b) depends on a only via I = a mod b. In special cases of a = I 
mod b, we can obtain closed formulas for D 2 (a; b), given in the following theorem. 

Theorem 1. Let I = a mod b then D 2 (a; b) is given by the following formulae: 



I 



0,1 

2 
3 



D2(a;b) 




m 2 2 

s LSJ +4LIJ + 1 

14L|j 

14 LIJ + 10 L!J + 2 
i4L|j%i6L!j+5 

30LIJ 2 



3 °[|J 2 + 14 LIJ + 2 

30 L|J 2 + 26 LIJ +6 
30 L|J 2 + 40 L|J +14 



ifbi 
ifb- 
ifb- 
ifbi 
ifbi 
ifb; 
ifbi 
ifbi 
ifb; 



0, 1 mod 3 
2 mod 3 
0, 1 mod 4 

2 mod 4 

3 mod 4 
0, 1 mod 5 

2 mod 5 

3 mod 5 

4 mod 5 



Proof. This follows directly from (4.2). 



□ 



In general, one can use (4.2) to obtain inequalities for D 2 (a;b) and hence for M 2 (a;b). To this 
extend, we use the fact that 



I > 1 





b 




mb 


2 


/ mb\ 2 






) 2 <- 




< 








_ I _ 







which implies the following bounds for I > 2. 



2 l-l 

E 

m=l 



m 



(I - 1)1(21 - 1) 



< D 2 (a-b) < 



by (l - 1)1(21 - 1) 



'-1 / L\ 2 

mb 



E 

.m=l 



Accordingly, we have the following: 
Theorem 2. For a = I mod b 



(4.3) 



M 2 (a;b) 



(6-1)6(26-1) 



if I = 0,1, 
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and for I > 2 

(4.4) 

and 

(4.5) 



M 2 (a;b) > 
M 2 (a;b) < 



(b-l)b(2b-l) 
6 



(6-1)6(26-1) 
6 



+ 



+ 



(I - 1)1(21 - 1) 
6 



b 2 (l- l)(2l- 1) 
61 



Naturally, (4.2) can be refined further to give even better bounds. We illustrate one further step 
here. Suppose as before that a = I mod 6 where 1 < I < b, and moreover that 6 = \b/l\ I + k, so 
that < k < I — 1. According to (4.2) this gives 



l-i 



m=l 



2 l-l 



m=l 



m([b/l\l + k) 



(I - 1)1(21 - 1) 



6 



+ 2 



l-i 



2 l-l 

= £ 

m=l 

mk 



m 



+ 



mk 



m=l 



E llbK sr-^ 
m — +^ 



l-l , 2 

mk 



m=l 



Lemma 3. For all I > 2 



D 2 (a;b) 



(I - 1)1(21 - 1) 



if k = 0,1, 



(4.6) D 2 (a;b) 



For example, if / = 4, k = 2 



and if I = 5, k = 2 



(I - 1)1(21 - 1) 
6 

02(4,6) 
£> 2 (5,6) 



+ 





6 

"I" 


{ 


J. 



((/-l)/-i/(/-2))+l i/ 

a-i ) z-Llj(i+LIJ) + LIJ ^ 



/ is even, 
I is odd. 



3-4-7 



+ 10 



4-5-9 



6 



+ 14 



+ 2 



+ 2 



as stated in Theorem 1. Finally, since [^rJ > [^r J f ° r ah £; > 2, the above formula of £> 2 (a; 6) for 
/c = 2 is a lower bound. 

Similar bounds can be derived "classically" by applying Dedekind's famous reciprocity law: 

Theorem 4 (Dedekind). If a and 6 are relatively prime then 

, > „n / ,\ /, n 1 1 /a 1 6 

(4.7) S (a,6) + S (6,a) = - i + - ^ + ^ + - 



Denote the rational function appearing in Theorem 4 by 
(4.8) 

Then we obtain for a = / mod 6, where a and 6 are relatively prime and 1 < I < 6, 
s(6, a) = i?(a, 6) - s(a, b) = R(a, 6) - s(l, b) = R(a, 6) - R(b, I) + s(b, I) . 



n , n 1 1 /a 1 6". 



HIGHER-DIMENSIONAL DEDEKIND SUMS AND THEIR BOUNDS 
It is well known (and a straightforward exercise) that 



9 



(4-9) | S(M )|< S ( M) = A_l+i_, 



which gives the following bounds: 

(4.10) R(a, b) - R(b, I) - s(l, I) < s(b, a) < R(a, b) - R{b, I) + s(l, I) . 

These inequalities, in turn, can be transformed into inequalities for M2 via (1.5), to obtain: 
Theorem 5. Lower and upper bounds for M2 are: 



(4.1!) M 2 (a; t) > ig + " ^ ~ " - <izi» - » ( fl(o , 6) - R( b , ,) + »(1, ,)) 

ba z za a 



(4.12) M 2 („; ») < < 62 + 1 "7 a2 1 « 2 -" - ^ - | (*(«, 6) - ^ - .(1,0) 



In the following table we give the exact values of aM2(a; b) and their lower bounds. We denote by 

flbl the lower bound according to (4.4), 
flb2 the lower bound according to (4.6), 
rib the lower bound according to (4.11), 
fub the upper bound according to (4.5), 
rub the upper bound according to (4.12). 



Note that we can compute rib and rub only when a and b are relatively prime. 
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a 


b 


exact 


flbl 


flb2 


rib 


fub 


rub 


5 


2 


2 


2 


2 


2 


2 


2 


5 


3 


6 


6 


6 


6 


6 


6 


5 


4 


14 


14 


14 


14 


14 


14 


6 


2 


3 


3 


3 




3 




6 


3 


10 


10 


10 




10 




6 


4 


18 


18 


18 




18 




6 


5 


30 


30 


30 


30 


30 


30 


7 


2 


3 


3 


3 


3 


3 


3 


7 


3 


10 


10 


10 


10 


10 


10 


7 


4 


19 


19 


19 


19 


19 


19.9 


7 


5 


34 


34 


34 


34 


34 


34 


7 


6 


55 


55 


55 


55 


55 


55 


35 


7 


455 


455 


455 




455 




39 


7 


490 


469 


481 


486.5 


497 


490 


40 


7 


501 


485 


501 


498.2 


513 


503.8 


41 


7 


510 


510 


510 


510 


529 


517.8 


10 


3 


15 


15 


15 


15 


15 


15 


11 


3 


16 


16 


16 


16 


16 


16 


21 


6 


185 


185 


185 




185 




20 


6 


174 


174 


174 




174 




11 


7 


126 


105 


117 


122.5 


133 


126 


10 


9 


204 


204 


204 


204 


204 


204 


11 


9 


220 


220 


220 


220 


220 


220 


12 


9 


249 


249 


249 




249 




13 


9 


260 


260 


260 


260 


274 


264.5 


14 


9 


288 


234 


250 


280.8 


301 


288 


15 


9 


315 


259 


286 




327 




16 


9 


328 


295 


328 


322.9 


354 


335.7 


17 


9 


344 


344 


344 


344 


381 


359.75 


24 


10 


648 


626 


648 




657 





5. Bounds for generalized Dedekind sums: the case d = 2 



5.1. Applications of the Cauchy-Schwartz inequality. In the present section we discuss some 
relationships between the S and the M-functions. By definition, if b\ = ■ ■ ■ = b d 

S d {a-bl d ) = M d {a-b) , d = l,2,... 

Here l d denotes the ci-dimensional vector all of whose components are 1. The Cauchy-Schwartz 
inequality yields immediately, for d = 2, the inequality 

5 2 (a; 61,62) < (M 2 (a;6 1 )M 2 (a;6 2 )) 1 / 2 , 

with equality if and only if b± = 6 2 . Let 

£2(0; 61,62) 



(5.1) R 2 {a; 61, 6 2 ) 



y / M 2 (a;b 1 )M 2 (a;b 2 ) 
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In the following table we give a few values of R 2 (a; &i, 62)- 



a 


b 2 


R 2 (a;2,b 2 ) 


R 2 (a;3,b 2 ) 


11 


7 


0.9163 


0.9799 


21 


5 


0.9237 


0.9721 


18 


11 


0.9297 


0.9729 


73 


39 


0.9189 


0.9695 


99 


33 


0.9192 


0.9707 



It is interesting to observe in this table that all these R 2 (a; b, c)-values are close to 1, and that 
among these values R 2 (a;2,b 2 ) < #2(0; 3, 62). The question is whether this inequality is always 
true. A partial answer is given in Lemma 10 of Section 5.2. Empirical evaluations lead us to the 
following conjecture: 

Conjecture 1. For all a,b,c> 3, 

(5.2) R 2 (a;b,c)> R 2 (5; 2,3) = ^- . 



( 


c 




2c 




(o-l)c 




-d- 


5 


a 


, . . . , 


a 



Notice that according to the previous definitions, R 2 (a; b, c) is the cosine of the angle between the 

two vectors 

( b 2b (a - 1)6 \ 

v fe = - , — , . . . , and v c = 

\\_aj \_ a j a j J 

In section 5.2 we present the geometrical correspondence, which is utilized to obtain further results. 
Exact formulae can be derived for R 2 (a; 2, o), a > 3. Indeed 

< 77, i.e., a is odd, 



(5.3) 
Moreover 

and 

Accordingly 
(5.4) R 2 (a;2,a) 
where 
(5.5) 



S 2 (a;2,a) = 



tt-l _ |a/2](l+|a/2]) - f 
2 2a 11 

^ - i (I - 1) if 



2, i.e., a is even. 



M 2 (a;2) 



1 



M 2 (a;a) 



2a 2 - 3a + 1 



#5(a)(a-l-l/a|a/2j(l + La/2j)) if 
i?2» (a - 1 - l/2(a/2 - 1)) if 



R*(a) = 



< |, i.e., a is odd, 
= |, i.e., a is even, 



6a 



2 v / [a/2j (2a 2 -3a + 1) 
A graph of R 2 (a; 2, a) for a = 3, ... , 50 is given in Figure 3. Notice that lim a _ >00 i?2(a; 2, a) = 



We provide here a few auxiliary results. First, if c = I + ia (i.e., a = / mod c) then 

a— 1 , /7 , . x a— 1 

i > m 



(5.6) 



S 2 (a;b,c) = ^ 



m=0 



bra 
a 



(I + m)m 



a 



E 

m=l 



6m 



a 



+ 5 2 (a; 6, /) . 
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CM 
CC 



10 



20 30 
a 



40 



50 



Figure 3. #2(0; 2, a) for a = 3, ... , 50 



Similarly, 

aM 2 (a;l + ia) = ^ 
Accordingly, 
(5.7) 



a-l 



m=0 



(I + ia)m 



where 

Thus, 
Now, 

and 



R 2 (a; b, I + ia) 



n(a— l)a(2a — 1) ^— i 
6 ^ 

m=l 



Egi^M+^W) 

Di 



Im 



+ aM 2 (c 



a-l 



, 1 r (a - l)o(2a - 1) 2 v-^ 
A = « I aM 2 (a; &p ^ '- + - ^ 



6 



m 



m=l 



1/2 



+ -M 2 (a,Z) 



lim R 2 (a;b,c) = lim i? 2 (a; 6, Z + ia) = R 2 (a;b,a) 



a-l 



aS 2 (a;ja,a) = ^ 



mja 



m=0 



ma 



a-l 



./ X! " ; 

m=0 



2 .(a-l)o(2a-l) 



a-l 



aM 2 (a;ja) = j 2 



m = j 



m=0 



2 (a - l)a(2a - 1) 
6 
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whence 

R-2,(a\ ja, a) = 1 for all j > 1 . 
We consider now i?2(a; 6, a) with a — > oo. Let a = j6. For j > 2 



R 2 (jb;b,jb) = 



Y^jfe-l 



jft-1 



2 \ V2 

2 (o-lWga-l) 



m=j 



m 

Li 

£ 

m=j 



6-1 (H-l)j-l 

S z Z m 

1=1 m=lj 



m 
3 . 



6-1 



3^=1 



_ 6j(6-l)(46j+j-3) 
12 

(6-1)6(26-1) 



i=i 



6 



Thus 
(5.8) 

D *m r t-j / .j i . t\ ,. (6-l)((46 + l)j-3) v/2(6-l)(46 + l) 

#2(6) = hm R 2 (jb;b,jb) = lira — - - — - = ) M = 

i-^2((6-l)(26-l)(j6-l)(2j6-l)) 1/2 46V(& - 1)(26 - 1) 
Some values of the limit are given in the table below. 



6 


i?*(6) 


2 


0.918558 


3 


0.96896 


4 


0.9836 



Figure 4. Some values of R* 2 {b) = lim^oo -R2C/6; b,jb) 



5.2. A Geometric correspondence. We have seen that the Dedekind-like sums S2(a;b,c) and 
M2(a; 6) can be considered as inner products in M a_1 . Thus, for a given integer a > 3, we construct 
a polyhedral cone C a C M a_1 that is defined by the positive real span of the vectors 



( 


6 




26 




(a - 1)6 


) 




a 




_ a _ 




a 





1 < 6 < 00. 



As stated in the introduction, the significance of these vectors is that ^2(0; 6, c) is the cosine of the 
angle between the two vectors v& and v c . The observation that #2(0; 6, c) is close to 1 is captured 
geometrically by the statement that this cone C a is thin in the angular metric. 

Notice that vi = = (0, . . . , 0) and v a = (1, 2, . . . , a — 1). For 6 = 2, 3, . . . , a — 1, the vectors v& are 

v 2 = (0,...,0,l,...,l),v 3 = (0,...,0,l,...,l,2,...,2),...,v o _i = (0,1,..., a -2), 
where each vector Vj has almost equally distributed values for the integers the comprise its entries. 

For 6 > a, we write 6 = ka+l, k > 0, < I < a, and it follows from our notation that = /cv a +v/. 
Thus the cone C a is in fact the positive real span of only the a — 1 vectors with 6 = 2, 3, . . . , a. 
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Since vi = 0, we have v/% a = Vka+i- Moreover, if Pi (2 < I < a — 1) denotes the 2-dimensional plane 
containing the vectors V/ and v , then all the vectors v^a+i, k = 0, 1, . . . belong to Pi. Notice that 
for different values of I, say I and I' ^ I, Pi and Py are two different planes which have the ray 

{rv a : r > 0} 

in common (see Lemma 6). Throughout this section, the denominators in the vector components 
of all the vectors v m are always the integer a. 




Figure 5. v a , v 6 + Zcv a (0 < k < 3), v c + jv a (0 < j < 3) 

The vectors V2, . . . , v a are not always linearly independent. One can easily check that if a = 3, 4, 6 
then these vectors are linearly independent, and when a = 5, 7, 8, 9, . . . they are not. However, one 
can prove the following: 

Lemma 6. For each a > 4, the vectors v a ,v/,v^, where 1 < 1,1' < a and I ^ I', are linearly 
independent. 

Proof. For each a > 4, v = (1, 2, . . . , a — 1), while the first component of both and v// is zero; 
thus v a is linearly independent of {v^, Vj/}. Moreover, and Vj/ do not lie on the same ray. □ 

Lemma 7. For each a > 3, ifb\ = k\a + Z and 62 = + ^ where 1 < fei < &2, i/ien 

R 2 (a;l,a) < i? 2 («;Z,6 2 ) < #2(0; Z, »x) 

/or all 1 < I < a. 

Proof. The vectors and v^ 2 lie in Pj. Moreover, v& 1 = Zciv a + and Vft 2 = /c2V a + vj. Hence 

Z (v{, v a ) > Z (v bl , v a ) > Z (v 6l , v ) . 
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But Z (vft, vj) = Z (v a , v;) — Z (v a , v&), and the inequalites in the statement follow by taking cosines. 
□ 

Notice that due to the monotonicity stated in the last lemma, 

lim R 2 (a\ I, ka + 1) = R 2 (a; I, a) . 

k^oo 

It is interesting to notice that in the case of a = 3, all vectors vj, are between V2 = (0, 1) and 
V3 = (1, 2). The cosine of the angle between these two vectors is 2/y/b. 

Lemma 8. (i) If a = 3,5 then R 2 (a; 2, 6) < R 2 (a; 3, 6) for all b > 3. 
(ii) If a = A then R 2 (A;2,b) < i? 2 (4;3,6) for all b + 6. 7/6 = 6 then i? 2 (4;2,6) = 0.9708 and 
i? 2 (4;3,6) = 0.9647. 

Proof, (i) The case a = 3 follows immediately from Lemma 7. For a = 5 we have 

v 2 = (0,0,1,1), |v 2 | = V2 
v 3 = (0,1,1,2), |v 3 | = V6 
v 4 = (0,1,2,3), |v 4 | = Vl4 
v 5 = (1,2,3,4), \v 5 \ = Vw. 

Let (-, •) denote the inner product of two vectors. For any b = 5k + I, k = 1, 2, . . . , / = 2, 3, 4, 

R 2 {5;2,5k + l) < R 2 (5; 3, 5fc + I) 

if and only if 

■j= (V2,V 5fc+ ,) < ^= (v 3 , v 5fc+i ) . 

This is equivalent to 

>/6fc(V2,V 5 ) + V6(v 2 ,V,) < >/2fc(v3,v 5 ) + >/2(v 3 ,v,) , 

or 

fc (l3V2 - 7V6) > V6 (v 2 , v,) - \/2 (v 3 , vj) . 

Thus, for / = 2 the inequality is true for all k > 0.53; for I = 3 or 4 it is true for all k > 0. Notice 
that for / = 2 and k = 0, we get 6 = 2. 

(ii) For a = 4, if 6 = 4/c + 2 then the inequality is true for all k > 1.72. For this reason, the 
inequality between i?2(4; 2, 6) and i?2(4; 3, 6) is reversed. If 6 = 4k + 3 the inequality is true for all 
k>0. □ 

Empirical evidence suggests that -R 2 (o; 2, 6) < R 2 {a] 3, 6) for all a > 6 and 6 > 3. We do not give a 
formal proof. 

Lemma 9. For all a > 3, i?2(o; 2, a) < i?2(a; 3, a). 



Proo/. If a = 3 then i? 2 (3;3,3) > i? 2 (3;2,3). For all a > 4, we have to show that B(a) > A(a), 
where 

A(a) = R 2 (a;2,a)\v a \ and S(a) = R 2 (a; 3, a)|v a | . 
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a mod 2 


A 



1 

a mod 3 


a ( 3o 2 ) 
8 1 

(« +L -/2j)(a-l-La/2j) 



1 

2 


90 (13a 9) 
a(a - 1) - 3/2 [a/3j - 5/2 [a/3j 2 


2^0 - 1 - 7/2 [a/3j 
a(a - 1) - 1 - 7/2 [a/3j - 5/2 [a/3j 2 


V4a + 7 - 7 La/3J 



FIGURE 6. A(a), a = 0,1 mod 2 and 5(a), a = 0,1,2 mod 3 

The formulas for A(a), a = 0, 1 mod 2 and 5(a), a = 0, 1, 2 mod 3 are given in Figure 6. 

One can easily check in all six cases that B(a) > A(a). □ 
Lemma 10. For each a > 3 and each b, c > 2 

i?2(o; &j c ) > m i n (^2(0; Z', a),R,2(a; I, a), R2(a; l' , Z)) 
where b = I mod a, c = Z' mod a. 

Proof. If Z, Z' = 0, 1 mod a, both vj, and v c are on the ray i? a , and i?2(&; b, c) = 1. 

If Z = Z' = 2, . . . ,a — 1 then v& and v c belong to and i?2(«; b, c) > #2(0; Z, a). 

Finally, if Z / I' one establishes the inequality by comparing the arcs on the unit sphere correspond- 
ing to the angles. These are the arcs between the points on the sphere on the rays generated by 
v a , v;, v v , v 6 , and v c . □ 

To prove Conjecture 1 it suffices to show that 

min R2(a;b,c) >i?2(5;2,3) . 

a>3, fe,c>2 

Let R^a) = min R-2,{a\ b, c). According to Lemma 10, for each a > 3 

b,c>2 

i?|(a) = min min (i?2(a; Z ', a), #2(0; Z, a), i?2(«; Z', Z)) , 

Kl,I'<a 

whence 

min i?2(o; 6, c) =mini?5(a) . 

a>3, 6,c>2 a>3 

In Figure 7 we present a plot of i?|(a) for a = 3, ... , 35. We see that in this range, i?2 (5; 2, 3) is 
the minimum, lending further credence to conjecture 1. 
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10 



15 



20 



25 



30 



35 



Figure 7. P|(a) for a = 3, . . . , 35 



6. Upper bounds for generalized Dedekind sums: higher dimensions 

6.1. Probability models. We introduce now a probability space and random variables, whose 
(mixed) moments yield the S- and M-functions. Let be a d-dimensional discrete sample 
space, consisting of a d points, that is, 

V d a) = {(mi, ...,m d ): mj = 0, . . . , a - 1, j = l,...,d}. 



A point in is a ci-dimensional vector m = (mi, . . . , m^). Consider the probability function on 



V 



(«). 



(6.1) 



P(m) 



1/a if m = j = 0, . . . , a — 1 
otherwise, 



where 1^ = (1,1,1,... ,1). This probability function is concentrated on the main diagonal points 
of . Define the random variables 



(6.2) 



if m = mid, th = 0, . . . , a — 1, 



otherwise, 
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where b = (61, . . . , bd). It follows immediately that 

(6-3) S d (a; b) = E P jj] x[ a) (m; b) j , 

where Ep{ } denotes the expected value of the term in braces with respect to the probability 
function P. Moreover, 

(6.4) Muia-M) = E P j (xf } (m; b)) * j . 

Notice that Mfc(a;6j) is the k-th order moment of x\ a ^ (m; b). The Dilcher-Girstmair presentation 
of the S- and M-functions can be described as moments of the random variables 

(6.5) jf ) (m;b)=^ 1 J /|m: ^ < m, < ^±1^ } , i = 1, . . . , d, 
where I{m : . . . } is the indicator function. According to this definition, 

(6.6) ^(a;b) = J E P |njf a) (m;b)| 
and 

(6.7) M fc (a;6 l ) = J B P |(j i (a) (m;b)) fc |. 

6.2. Upper bounds for Sd(a; b). In the present section we use the random variables x\ a \m; b). 
Since a and b are fixed, we will simplify the notation to calling the random variables X\, . . . , Xj. 
Repeated application of the Cauchy- Schwartz inequality yields bounds in terms of the one-dimen- 
sional moments M. For example, for d = 2 we obtain 

E {X ± X 2 } < (E {Xj} E {Xl}f 12 , 

and thus 

S 2 {a;b u b 2 ) < (M 2 (a,b 1 )M 2 (a,b 2 )) 1/2 . 

For d = 3 we get 

E{X 1 X 2 X 3 }<(E{X 2 l }E{xlx!}) 1/2 < {E{XI}{E{X^}E{XI}) 112 ) 112 



or 



(6.8) S 3 (a; b u b 2 , b 3 ) < M^ /2 (a, &i)M 4 1/4 (a, 6 2 )M 4 1/4 (a, 63) • 

By taking the geometric mean of the cyclical permutations, we get the symmetric upper bound 

S 3 (a; 61, 6 2 ,6 3 ) < lnM 2 (a,& i )M 4 (a 1 6 i ) 
For d = 4 we similarly obtain 

(6.9) S 4 (a; h,b 2 , b 3 , 64) < I JJ M 4 (a, b 3 
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For d = 5 we start with 

S 5 (a;b) = E P {X 1 ---X 5 } 

< (E P {XlXl}) 1 ' 2 (E P {XIXIXI}) 1 ' 2 



5 * 1/12 



< (M 4 (a;6 1 )M 4 (a;6 2 )) 1/4 J{M^a;bj)M % {a;bj) 



\j=3 

Symmetrizing this upper bound by taking the geometric mean of the ( 2 ) different bounds obtained 
by different selections of pairs and triplets gives 

1/20 

(6.10) S 5 (a;b)< I JI M|(a; 6 i )M 8 (a; b 3 ) 
From the upper bound for 53 we immediately obtain 

1/12 

(6.11) S 6 (a;h)< I J[M 4 (a;bj)M 8 (a;bj) 
Generally, if d = 2k, k = 1, 2, . . . , we have 

(6.12) S 2k (a; b) < (E P {Xf ■ ■ ■ Xl}f 2 (E P {X* k+1 ■ ■ ■ X^f 2 , 
from which we get, by utilizing previous results, symmetric upper bounds. For example, 

(6.13) S 8 (a; b) < (M 8 (o; 61) • • • M 8 (a; b 4 )) 1/8 (M 8 (a; b 5 ) ■ ■ • M 8 (a; 6 8 )) 1/8 

= (M 8 (a;6 1 )---M 8 (a;6 8 )) 1/8 , 

and 

1/40 

(6.14) S w (a; b) < I J] M 8 3 (a; b j )M 16 (a; bj) 

We can immediately prove by induction the following: 
Lemma 11. 

/ 2* \ ^ 

(6.15) ^(o;b)<mM 2fc (a;6,) ,fe = l,2,... 



Similarly, for k = 0, 1, . . . 



u'=i 



\ l/6-2 fe 

'3-2 fc x 7 



(6.16) 5 3 .2fc(o;b)< JJ M 2fc+ i(a; 6 j )M 2fc+2 (a; 6j] 

and 



'5-2 



fe \ 1/20-2* 



(6.17) S 5 . 2fc (a;b) < [ JJ M 2 3 *+2 (°: 6j)M 2fc+3 (a;6j) 



20 

If d = 
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2k + 1 one needs a two-stage process of first partitioning to 



( ; ( i|v !) (' l iiv I) 

and then symmetrizing. 

Before concluding this section, we remark that the above upper bounds for the 5-functions arc 
generally not unique. By different partitions one can obtain different bounds. For example, in the 
case of S5, one could start with 



1/8 



E P {*!■■■ X 5 } < (E P {Xf}) 1/2 (E P {Xl ■ ■ ■ X 2 5 }) 1/2 = (M 2 (a; h)) 1 / 2 [ ]J M 8 (a; b 



1/2 



U=2 



After symmetrization we get 



1/10 



(6.18) 



S 5 (a;h) < \ Y[M 2 (a;b j )M 8 (a;b j 



U =1 



The question is which upper bound should be used, (6.10) or (6.18)? For example, if a = 31 and 
b = (3, 5, 7, 11, 13) then S 5 (a; b) = 1213.806. The upper bound given by (6.10) is 1321.321, whereas 
that given by (6.18) is 1456.985. In the following table we present some exact values of £5(0; b) 
and the two bounds (6.10) and (6.18). We also show R^(a;h), the ratio of 5s(a;b) to the upper 
bound (6.10). 



a 


b 


s 5 


bound (6.10) 


bound (6.18) 


#5 


31 


(3, 5, 7, 11, 13) 


1213.806 


1321.321 


1456.985 


0.9186 


21 


(5, 7, 9, 11, 13) 


4411.333 


4668.719 


5190.201 


0.9449 


23 


(5, 9, 11, 13, 17) 


11429.74 


12050.58 


13385.72 


0.9485 


27 


(5, 11, 13, 17, 21) 


28101.93 


29617.94 


33011.8 


0.9488 


33 


(7, 11, 13, 19, 23) 


51943.76 


54384.26 


60525.59 


0.9551 



Figure 8. Some values and bounds of S5 



It seems from Figure 8 that the upper bound given by (6.10) is closer to the exact value of £5(0; b) 
than (6.18). It is the preferred upper bound. It is also interesting that, like in the case of i?2(o; b), 
all values of i?s(a; b) in Figure 8 are greater than 0.9186. 
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6.3. Relationships to upper bounds revisited. We study now upper bounds to S d of the type 
given by (6.8)-(6.17). In particular, define 

(6.19) R 3 (a; b) = — , 



(n j 3 =1 M 2 (a,yM 4 (a,y) 



(6.20) RtUr-b) - 54( ° ;b) — 4 , and 



(6.21) R 5 (a;b) 



S 5 (a;h) 



(n"=iM 4 3 (a;6 J )M 8 (a;6 J )) 



1/20 



A few values of i?s(a;b) are given in in Figure 8. It seems that the minimal i?s(a;b) value is 
i?5(7;2, 3,4, 5,6) = 0.8567. It is also interesting to observe that R§, as shown in Figure 8, is 
generally above 0.9, as in the case of R 2 , despite the increase in dimension from 2 to 5. We try to 
explain this phenomenon in probability terms. 

As shown in (6.3) and (6.6), 

S d (a;b)=Ep{X l ---X d } = Ep{J 1 ---J d } . 
Consider the case d = 2. According to the law of iterated expectation [1], 

(6.22) S 2 {a-MM) =Ep{j (a Hm;b 1 )Ep{j^(m;b 2 ) | j( a )(m; , 

where the second term on the right-hand side is the conditional expectation of j( a )(m;62), given 
j( a )(m;6i). Notice that in the notation of Section 3, 

(6.23) S 2 (a;b,c) = ^«() = £X>/-*tf)E#4^ " 

The key to understanding the phenomenon is that the joint frequencies f a -b,c(j, are distributed 
along the main diagonal, as illustrated in Figure 2. In the special case that b = c, 



fa;b,c{j, I) 



fa;b(j) = fa;c(j) ^ j + h 

otherwise. 



In this case, 



and 



■y^j fa:b,c(ji Q _ ( 



() fa;b{j) 



1 &_1 

S 2 (a;b,b) = -Y,j 2 fa;b(j) = M 2(a;b) , 



3=0 



as expected. When b / c then R 2 is always smaller than 1, but might be quite close to it, even 
when b and c are different. For example, i?2(50; 7, 13) = 0.9955. 
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6.4. One-dimensional moments relationships. We present here some inequalities between 
M r (a; b) for fixed values of a and b and for variable r. 

Theorem 12. M r (a;b) is log-convex in r. That is, 

(6.24) M 2r (a; b)M 2r+2 (a; b) - M 2 2 r+1 (a; b) > 0. 

Proof. First, by Liapounov's inequality of moments [6, p. 627] we have 

Mi < M 2 1/2 < M 3 1/3 < . . . 
By factoring |_- • • J 2r+1 = |_- • • J 1 L- ■ ■ J T+1 we obtain the inequality 

(6.25) Mf r+1 (a; b) < M 2r (a; b)M 2r+2 (a; b) 

for all r > 1. That is, M r (a; b) is log-convex in r. □ 

Appendix 

Algorithm for Determining f£(i), i = 0, . . . , b — 1 

STEP 0: 
Set: 

(i) r = a/b; 

(ii) 1 = a-b* [r] ; 

(iii) f = 0(1, b); # b-dimensional vector of zeros 

STEP 1: 

Compute : 

f [0] <- l+[r]; 

f [ i ] <- [(i+l)*r]- [i*r], i=l , . . , (b-2) ; 
f [ b-1] <- a-1- [(b-l)*r] ; 

STEP 2: 

IF ((1=0) or (b=2)) { GOTO STEP 3 }; 
ELSE { 

FOR ( i=l, . . . ,b-2) { 

IF( [(i+l)*r] = r*(i+l) ) { 
f[i] <- f[i]-l; 
f[i+l] <- f[i+l]+l; 

} 

} 

STEP 3: 
PRINT f 



END. 
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Algorithm for Determining f£ c (i,j) 



STEP 0: 



Set: 



rl 


<- 


a/b; 


r2 


<- 


a/c; 


11 


<- 


a -b* [rl] ; 


12 


<- 


a - c* [r2] ; 


CT 


<- 


0((b+l),(c+D); 


tl 


<- 


0(b,l); 


t2 


<- 


0(1, c); 



# matrix of zeros, of dimensions (b+l)*(c+l) 



STEP 1: 



Compute : 

CT[i,(c+l)] <- f_b[i-H, i=l,...,b; 
CT[(b+l), j] <- f_c[j-l], j=l,...,c; 
CT[(b+l),(c+l)] <- a; 

CT[1,1] <- min( CT[l,c+l], CT[b+l,l]); 
t2[l] <- t2[l] + CT[1,1] ; 



STEP 2: 



Compute : 

FOR (i=2, . . . ,b) { 

CT[i,l] <- max(0, min(f _c [1] -t2 [1] , f_b[i-l]))); 

t2[l] <- t2[l] + CT[i,l] ; 

} 

tl <- CT[l:b,l] ; 



STEP 3: 



Compute : 

FOR (j=2,...,c) { 

CT[l,j] <- max(0,min(f_b[0]-tl[l] , f_c[j-l])); 
tl[l] <- tl[l] + CT[l,j] ; 
t2[j] <- t2[j] + CT[l,j] ; 
} 



STEP 4: 



Compute : 

FOR (i=2, . . . ,b) { 

F0R(j=2, . . . ,c) { 

cty<- max(0, min(f _b [i-1] - tl[i], f_c[j-l])); 

ctx <- max(0, min(f _c [j-1] -t2 [j] , f_b[i-l])); 

CT[i,j] <- min(ctx,cty) 

tl[ i ] <- tl [ i ]+ CT [ i, j] ; 

t2[ j ] <- t2 [ j ] + CT [i , j] ; 

} 
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> 

STEP 5: 
Print CT 

END. 
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